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Abstract — Can intelligence optimise Digital Ecosystems? How 
could a distributed intelligence interact with the ecosystem 
dynamics? Can the software components that are part of 
genetic selection be intelligent in themselves, as in an adaptive 
technology? We consider the effect of a distributed intelligence 
mechanism on the evolutionary and ecological dynamics of 
our Digital Ecosystem, which is the digital counterpart of 
a biological ecosystem for evolving software services in a 
distributed network. We investigate Neural Networks (NNs) 
and Support Vector Machine (SVM) for the learning based 
pattern recognition functionality of our distributed intelligence. 
Simulation results imply that the Digital Ecosystem performs 
better with the application of a distributed intelligence, marginally 
more effectively when powered by SVM than NNs, and suggest 
that it can contribute to optimising the operation of our Digital 
Ecosystem. 

Index Terms- evolution, ecosystem, migration, intelligence, 
distributed 



I. Introduction 

With a Digital Ecosystem, being the digital counterpart of 
a biological ecosystem for evolving software services in a dis- 
tributed network, can we answer the following questions; Can 
intelligence optimise the evolutionary process? How could a 
distributed intelligence interact with the ecosystem dynamics? 
Can the software components that are part of genetic selection 
be intelligent in themselves, as in an adaptive technology? 
These are wide ranging questions, and we have started by 
considering a distributed intelligence based on a simple social 
interaction mechanism that leads to targeted migration. We 
will use a machine learning technique to power our distributed 
intelligence, considering both NNs and SVM. We will start 
with a brief reminder for our definition of Digital Ecosystems. 

Our Digital Ecosystem [1] is the digital counterpart of 
a natural ecosystem [2], [3], which automates the search 
for new algorithms in a scalable architecture, through the 
evolution of software services in a distributed network. In the 
Digital Ecosystem, local and global optimisations concurrently 
operate to determine solutions to satisfy different optimisation 
problems. The global optimisation here is not a decentralised 
super-peer based control mechanism [4], but the completely 
distributed peer-to-peer network of the interconnected habitats. 
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Fig. 1. Digital Ecosystem: Optimisation architecture in which agents travel 
along the peer-to-peer connections; in every node (habitat) local optimisation 
is performed through an evolutionary algorithm, where the search space is 
determined by the agents present at the node. 



which are therefore not susceptible to the failure of super- 
peers. This is a novel optimisation technique inspired by bio- 
logical ecosystems, working at two levels: a first optimisation, 
migration of agents which are distributed in a peer-to-peer 
network, operating continuously in time; this process feeds 
a second optimisation, based on evolutionary combinatorial 
optimisation, operating locally on single peers and is aimed at 
finding solutions that satisfy locally relevant constraints. So, 
the local search is improved through this twofold process to 
yield better local optima faster, as the distributed optimisation 
provides prior sampling of the search space through computa- 
tions already performed in other peers with similar constraints 
[5]. The services consist of an executable component and 
a descriptive semantic component. Analogous to the way in 
which an agent is capable of execution and has an ontological 
description. So, if the services are modelled as software agents 
[6], then the Digital Ecosystem can be considered a Multi- 
Agent System (MAS) which uses distributed evolutionary 
computing to combine suitable agents available to meet user 
requests for applications. 

The motivation for using parallel or distributed evolutionary 
algorithms is twofold: first, improving the speed of evolu- 
tionary processes by conducting concurrent evaluations of 
individuals in a population; second, improving the problem- 
solving process by overcoming difficulties that face traditional 
evolutionary algorithms, such as maintaining diversity to avoid 
premature convergence [7], [8]. The fact that evolutionary 



computing manipulates a population of independent solutions 
actually makes it well suited for parallel and distributed 
computation architectures [9]. There are several variants of 
distributed evolutionary computing, leading some to propose 
a taxonomy for their classification [10], with there being two 
main forms [9], [8]: multiple-population/coarse-grained migra- 
tion/island models [11], [9], and single-population/fine-grained 
diffusion/neighbourhood models [12], [8]. Fine-grained dif- 
fusion models [12], [8] assign one individual per processor. 
A local neighbourhood topology is assumed, and individuals 
are allowed to mate only within their neighbourhood, called a 
deme^ . The demes overlap by an amount that depends on their 
shape and size, and in this way create an implicit migration 
mechanism. Each processor runs an identical evolutionary 
algorithm which selects parents from the local neighbourhood, 
produces an offspring, and decides whether to replace the 
current individual with an offspring. In the coarse-grained 
island models [11], [9], evolution occurs in multiple parallel 
sub-populations (islands), each running a local evolutionary 
algorithm, evolving independently with occasional migrations 
of highly fit individuals among sub-populations. This model 
has also been used successfully in the determination of invest- 
ment strategies in the commercial sector, in a product known 
as the Galapagos toolkit [14], [15]. However, all the islands in 
this approach work on exactly the same problem, which makes 
it less analogous to biological ecosystems in which different 
locations can be environmentally different [16]. 

The landscape, in energy-centric biological ecosystems, 
defines the connectivity between habitats [16]. Connectivity 
of nodes in the digital world is generally not defined by 
geography or spatial proximity, but by information or semantic 
proximity. For example, connectivity in a peer-to-peer network 
is based primarily on bandwidth and information content, and 
not geography. The island-models of distributed evolutionary 
computing use an information-centric model for the connec- 
tivity of nodes {islands) [11]. However, because it is generally 
defined for one-time use (to evolve a solution to one problem 
and then stop) it usually has a fixed connectivity between 
the nodes, and therefore a fixed topology [9]. So, supporting 
evolution in the Digital Ecosystem, with a dynamic multi- 
objective selection pressure (fitness landscape [17] with many 
peaks), requires a re-configurable network topology, such that 
habitat connectivity can be dynamically adapted based on the 
observed migration paths of the agents between the users 
within the habitat network. So, based on the island-models 
of distributed evolutionary computing [11], each connection 
between the habitats is bi-directional and there is a probability 
associated with moving in either direction across the connec- 
tion, with the connection probabilities affecting the rate of 
migration of the agents. However, additionally, the connection 
probabilities will be updated by the success or failure of 
agent migration using the concept of Hebbian learning [18]: 
the habitats which do not successfully exchange agents will 

^In biology a deme is a term for a local population of organisms of one 
species that actively interbreed with one another and share a distinct gene 
pool [13]. 



become less strongly connected, and the habitats which do 
successfully exchange agents will achieve stronger connec- 
tions. This leads to a topology that adapts over time, resulting 
in a network that supports and resembles the connectivity of 
the user base, typically small- world networks [19], [20]. They 
have many strongly connected clusters (communities), called 
sub-networks (quasi-complete graphs), with a few connections 
between these clusters (conmiunities) [21]. Graphs with this 
topology have a very high clustering coefficient and small 
characteristic path lengths [21], as visualised in Figure 1. 
The novelty comes from the creation of multiple evolving 
populations in response to similar requests, whereas in the 
island-models of distributed evolutionary computing there 
are multiple evolving populations in response to only one 
request [11]. So, in our Digital Ecosystem different requests 
are evaluated on separate islands (populations), with their 
evolution accelerated by the sharing of solutions between the 
evolving populations (islands), because they are working to 
solve similar requests (problems). 

The users will formulate queries to the Digital Ecosystem 
by creating a request as a semantic description, like those 
being used and developed in Service-Oriented Architectures 
[22], specifying an application they desire and submitting it 
to their habitat. This description enables the definition of a 
metric for evaluating the fitness of a composition of agents, 
as a distance function between the semantic description of the 
request and the agents' semantic descriptions. A population is 
then instantiated in the user's habitat in response to the user's 
request, seeded from the agents available at their habitat (i.e. 
its agent-pool). This allows the evolutionary optimisation to 
be accelerated in the following three ways: first, the habitat 
network provides a subset of the agents available globally, 
which is localised to the specific user it represents; second, 
making use of agent- sequences previously evolved in response 
to the user's earlier requests; and third, taking advantage of 
relevant agent- sequences evolved elsewhere in response to 
similar requests by other users. The population then proceeds 
to evolve the optimal agent-sequence(s) that fulfils the user 
request, and as the agents are the base unit for evolution, it 
searches the available agent-sequence combination space. For 
an evolved agent- sequence that is executed (instantiated) by 
the user, it then migrates to other peers (habitats) becoming 
hosted where it is useful, to combine with other agents in other 
populations to assist in responding to other user requests for 
applications. 

II. Distributed Intelligence 

Potential exists to optimise the distribution of the agents 
within the habitat network, through additional targeted migra- 
tion of the agents, which will indirectly optimise the evolv- 
ing agent populations. The migration probabilities between 
the habitats produces the existing passive agent migration, 
allowing the agents to spread in the correct general direction 
within the habitat network, based primarily upon success at 
their current location. This augmentation will work in a more 
active manner, allowing the agents highly targeted migration to 
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Fig. 2. User Request to the Digital Ecosystem (modified from [23]): A user 
will formulate queries to the Digital Ecosystem by creating a request as a 
semantic description, like those being used and developed in Service-Oriented 
Architectures [22], specifying an application they desire and submitting it to 
their habitat. A population is then instantiated in the user's habitat in response 
to the user's request, seeded from the agents available at their habitat (agent- 
pool). 



specific habitats, in addition to the generally directed passive 
migration. It will help to optimise the agents found at the 
agent-pools of the habitats, which will in turn optimise the 
evolving agent populations as they make use of the agent- 
pools when determining applications (agent-sequences) to user 
requests. So, accelerating the process of ecological succession 
[16], and therefore the responsiveness of the Digital Ecosystem 
to the user base. 

The targeted migration will work by providing the agents 
with the opportunity to interact inside the agent-pools, outside 
of the evolutionary optimisation of the evolving agent popu- 
lations, to determine if they are functionally similar based on 
their semantic descriptions. Similar agents will compare their 
migration histories to determine habitats where they could 
find a niche (i.e. be valuable). This will lead to additional 
highly targeted migration of agents throughout the habitat 
network, optimising the set of agents and agent-sequences 
at the agent-pools, and therefore indirectly optimising and 
accelerating the evolving agent populations within the Digital 
Ecosystem. As each evolving agent population within the 
Digital Ecosystem will be accelerated, the entire ecosystem 
will operate more efficiently. The targeted migration will 
strengthen the agent concept within the Ecosystem-Oriented 
Architecture of Digital Ecosystems, endowing the individual 
agents with some intelligence and control over their behaviour. 
Interestingly, the effectiveness of this augmentation relies on 
the local interactions of the agents, producing an emergent 
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Fig. 3. Targeted Migration: This augmentation will optimise the distribution 
of the agents within the habitat network, through additional targeted migration 
of the agents, helping to optimise the agents found at the agent-pools of 
the habitats, which will in turn optimise the evolving agent populations. 
So, accelerating the process of ecological succession [16], and therefore the 
responsiveness of the Digital Ecosystem to the user base. 



global optimising effect on the evolving agent populations to 
accelerate the ecological succession of a Digital Ecosystem. 

The targeted migration will directly optimise the ecological 
migration, and therefore indirectly complement the evolu- 
tionary self-organisation of the evolving agent populations, 
through the highly targeted migration of the agents to their 
niche habitats. The migration probabilities between the habi- 
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Fig. 4. Agent Life-Cycle With Targeted Migration: The agent life-cycle 
will change to support the targeted migration, as shown by the blue circle. 
Specifically, there will be more opportunities for agent migration, but more 
importantly these opportunities will be for targeted migration, which will help 
to optimise the set of agents found at the habitats. 



tats produces the existing passive agent migration, allowing 
the agents to spread in the correct general direction within 
the habitat network, based primarily upon success at their 
current location. The targeted migration will work in a more 
active manner, allowing the agents highly targeted migration to 
specific habitats, based upon their interaction with one another 
to discover habitats where they could be valuable (i.e. find a 
niche). 

The targeted migration will occur when users deploy 
their services, specifically when deploying their representative 
agents to their habitats within the Digital Ecosystem, and upon 
the execution of applications (groups of services), specifically 
the resulting passive migration of their representative agent- 
sequences between the habitats. The agent-sequences arriving 
at habitats, with respect to the targeted migration, will be 
treated as individual agents arriving at the habitats. So, an 
agent arriving at a habitat interacts one-on-one with agents 
already present within the agent-pool of the habitat, and upon 
determining functional similarity, based upon comparing their 
semantic descriptions, will share other habitats successfully 
visited from their respective migration histories. An agent 
migration history is the migratory path of the agent through 
the habitat network, including its use at the habitats visited. So, 
similar agents can share their migration histories to discover 
new habitats where they could be valuable, and then use 
targeted migration (via a copy, and not a move) to explore 
the most promising of the recently acquired habitats. This will 
allow successfully interacting agents to target specific habitats 
where they will potentially be useful, but risks potentially 
infinite targeted migration, because targeted migration itself 
can lead to further targeted migration. So, each agent will 
require a dynamic targeted migrations counter, which defines 
the number of permitted targeted migrations of the agent. 
This counter will be incremented upon an agent's execution in 
response to a user request, and decremented upon performing 
a targeted migration. 

The agent life-cycle will change to support the targeted 



migration, as shown in Figure 4 by the blue circle. Specifically, 
there will be more opportunities for agent migration, but more 
importantly these opportunities will be for targeted migration, 
which will help to optimise the set of agents found at the 
habitats, and therefore support the evolving agent populations 
created in response to user requests for applications. The 
targeted migration will essentially short-circuit the hierar- 
chical topology habitat network, which is what allows it 
to specialise and localise to communities, providing specific 
solutions to specific requests from specific users. However, the 
targeted migration will also reinforce the hierarchical topology 
of the habitat network, because targeted migration between 
connected habitats will accelerate the existing migration of 
agents, while between unconnected habitats will assist the 
Digital Ecosystem in supporting emerging communities. So, 
the targeted migration will help strengthen and catalyse the 
formation of clusters within the habitat network, and will also 
assist in locating habitats within the correct clusters. Therefore, 
the optimisation of the Digital Ecosystem will be a global 
emergent effect resulting from the local interactions of the 
agents, allowing for niches to be fulfilled faster and so ac- 
celerating the process of ecological succession [16]. Also, the 
Digital Ecosystem will adapt faster to changing environmental 
conditions (e.g. changes in the request behaviour of user com- 
munities). In biological terms the targeted migration endows 
the agents with a form of reciprocal altruistic behaviour [24], 
consistent with the agent paradigm of the Ecosystem-Oriented 
Architecture. 

A. Similarity Recognition 

For the targeted migration to work successfully an effec- 
tive technique will be required for the similarity recognition 
between the semantic descriptions of two agents. Each agent 
will have an embedded similarity recognition component to 
maintain the consistency of the agent paradigm of Ecosystem- 
Oriented Architectures. So, the agents will interact one-on-one 
to determine functional similarity based upon their semantic 
descriptions, using their embedded similarity recognition com- 
ponents, with each of the two interacting agents determining 
similarity for themselves. Again, this is to maintain the con- 
sistency of the agent paradigm. Similarity recognition between 
the semantic descriptions of two agents will require some form 
of pattern recognition, because there is no single standard for 
the semantic description of services [25], and we would not 
adopt any one over the others as it would be inconsistent with 
the inclusive nature of Digital Ecosystems. So, we will now 
consider the field of pattern recognition to determine suitable 
techniques for the similarity recognition components to be 
embedded within the agents. 

Pattern recognition aims to classify data (patterns) based 
on priori knowledge or on statistical information extracted 
from the data [26]. Pattern recognition requires a sensor or 
sensors for data acquisition, a pre-processing technique, a data 
representation scheme, and a decision making model [27]. 
Also, learning from a set of examples (training set) is an 
important and desirable feature of most pattern recognition 



systems [26]. The four best known approaches for pattern 
recognition are [27]: 

• Template Matching 

• Statistical Classification 

• Structural Matching 

• Neural Networks 

These approaches are not necessarily independent, and some- 
times the same pattern recognition method exists with different 
interpretations [27]. For example, attempts have been made to 
design hybrid systems involving multiple approaches, such as 
the notion of attributed grammars which unifies structural and 
statistical pattern recognition [28]. 

Template Matching is not suitable for the required pattern 
recognition of our targeted migration, because its effective 
use is domain specific [27] and the similarity recognition 
between the semantic descriptions of agents is very different 
to the domains that it is typically applied [29]. Statistical 
Classification is also not suitable, because the embedded 
similarity recognition component of each agent would require 
human intervention for variable selection and transformation 
[30]. Structural Matching is suitable theoretically, but im- 
plementations lead to many difficulties [27], including the 
segmentation of noisy patterns (to detect primitives) and the 
inference of grammar from training data [27]. There can also 
be a combinatorial explosion of possibilities to be investigated, 
demanding large training sets and significant computational 
effort [31], neither of which is available. Neural Networks 
are suitable, given their low dependence on domain-specific 
knowledge and the availability of efficient learning algorithms 
[27]. Support Vector Machines, albeit a recent development 
[32], are also suitable [33], being primarily a binary classifier 
[27] for training generalisable nonlinear classifiers in high- 
dimensional spaces using small training sets [34]. So, we 
will make use of both Neural Networks and Support Vector 
Machines for our targeted migration. 

1 ) Neural Networks: In the first instance, we will leverage 
the pattern recognition capabilities of Neural Networks (NNs) 
for the embedded similarity recognition components of the 
agents, allowing them to determine similarity to one another 
based on the similarity of their semantic descriptions. We 
will use multilayer perceptrons (feed-forward artificial NNs) 
with backpropagation [35] to provide the required pattern 
recognition behaviour, because of their ability to solve prob- 
lems stochastically, which allows for approximate solutions 
to extremely complex problems [35]. They are a modification 
of the standard linear perceptron [36], using three or more 
layers of neurons (nodes) with nonlinear activation functions 
to distinguish data that is not linearly separable, or separable 
by a hyperplane [37]. The power of the multilayer perceptron 
comes from its similarity to certain biological neural networks 
in the human brain, and because of their wide applicability 
has become the standard algorithm for any supervised-leaming 
pattern recognition process [35]. 

NNs can be viewed as massively parallel computing systems 
consisting of an extremely large number of simple processors 
with many interconnections [26]. NN models attempt to use 



certain organisational principles (such as learning, generalisa- 
tion, adaptivity, fault tolerance, distributed representation, and 
computation) in a network of weighted directed graphs, in 
which the nodes are artificial neurons, and the directed edges 
(with weights) are connections between the neuron outputs 
and inputs [26]. The main characteristics of NNs are their 
ability to learn complex nonlinear input-output relationships, 
use sequential training procedures, and adapt themselves to the 
data [27]. The most conmionly used fanuly of NNs for pattern 
classification tasks is the feed-forward network, including 
multilayer perceptrons, which are organised into layers and has 
unidirectional connections between the layers [27]. Another 
popular network is the Self-Organising Map, or Kohonen- 
Network [38], which is often used for feature mapping [27]. 
The increasing popularity of NN models to solve pattern 
recognition problems has been primarily because of their 
low dependence on domain- specific knowledge (relative to 
model-based and rule-based approaches) and the availability 
of efficient learning algorithms [27]. The learning process 
involves updating the network architecture and connection 
weights so that a network can efficiently perform a specific 
classification [26]. NNs provide a suite of nonlinear algorithms 
for feature extraction (using hidden layers) and classification 
(e.g. multilayer perceptrons) [27]. In addition, existing feature 
extraction and classification algorithms can be mapped onto 
NN architectures for efficient (hardware) implementation [39]. 
Despite the seemingly different underlying principles, most 
of the well-known NN models are implicitly equivalent or 
similar to classical statistical pattern recognition methods [27]. 
However, NNs offer several advantages, such as unified ap- 
proaches for feature extraction and classification, and flexible 
procedures for finding good, moderately nonlinear solutions 
[27]. 

A pre-processing [37] of agent semantic descriptions will be 
required that is consistent across the entire Digital Ecosystem, 
requiring an alphabetical ordering of the attribute tuples within 
a semantic description, a standardisation of the length of the 
attributes, before finally making use of a binary encoding 
for processing by a NN [37]. The assumption of information 
structured as tuples, including an attribute name and attribute 
value, is accurate for our simulated semantic descriptions, but 
is also a reasonable assumption for any semantic description 
of web services [40], [41], [42], [25]. To standardise the length 
of the attributes, after removing any white-space^, an average 
word length of six characters will be used, because 5.39 is 
the average word length for business English [44]. For the 
binary encoding we propose using Unicode (UTF-8), which 
is based on extending ASCII to provide multilingual support 
[45]. However, ASCII's support of only EngHsh [45] will be 
sufficient for our simulations. The size (number of neurons) 
of the input layer [35] will be proportional to the semantic 
description of the agent in which the NN is embedded, taking 
advantage of the variation in length of different semantic de- 

white-space is any single character or series of characters that represent 
horizontal or vertical space in typography [43]. 
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Fig. 5. Neural Network for the Similarity Recognition Component of 
agents in Targeted Migration: Consisting of an input layer proportional to the 
semantic description of the agent in which it is embedded. A single hidden 
layer, and an output layer consisting of a single neurone to provide a binary 
response to the question of whether another agent's semantic description is 
similar 



scriptions, which will assist the NN-based pattern recognition 
in determining dissimilarity. 

We will use a single hidden layer, which is usually sufficient 
for most tasks [37]. The size of which will be determined 
through exploratory programming [46] in our simulations, be- 
cause of the difficulty in determining the optimal size without 
training several networks and estimating the generalisation 
error [47], evident by the range of inconsistent rules of thumb 
[48], [49], [50], [51] available to define the optimal size. The 
output layer [37] will consist of a single neurone to provide 
a binary (true or false) response to the question of whether 
another agent's semantic description is similar to the agent's 
own semantic description. We will use a threshold of 0.90 
on its output for the determination of similarity. The overall 
structure of the Neural Network is visualised in Figure 5. 

Multilayer perceptrons use nonlinear activation functions, 
which were developed to model the frequency of action 
potentials (firing) of biological neurons in the brain [35]. The 
main activation function used in current applications is the 
sigmoid function [35], a hyperbolic tangent that is normalised 
and in which the output of a neurone is the sum of the 
weighted input values x [37], 



The weights x between the neurons will be randomly ini- 
tialised, then trained to the real numbers that provide the 
desired functionality, because learning occurs in the perceptron 
by changing the connection (synaptic) weights after each piece 
of data is processed, based on the error of the output compared 
to the expected result [37]. This is an example of super- 
vised learning and is carried out through backpropagation, a 
generalisation of the least mean squares algorithm [35]. The 
network is therefore trained by providing it with input and 
corresponding output patterns [37]. 



The NN-based embedded similarity recognition component 
of an agent will be trained when the agent is deployed to a 
habitat of the Digital Ecosystem. The initial training set will 
consist of the semantic description of the agent as a positive 
match, and variants created from its own semantic description. 
If the variant is less than 10% different it will be processed 
as a positive match, else it will be processed as a negative 
match. The training set can be extended based on experience, 
making use of when an agent visits a habitat through targeted 
migration (i.e. one acquired from an inter-agent interaction); if 
visiting the habitat proves successful the semantic description 
of the interacting agent can be appended to the training set as 
a positive match, else as a negative match. 

2) Support Vector Machines: In the second instance, we 
will leverage the pattern recognition capabilities of Support 
Vector Machines (SVMs) for the embedded similarity recog- 
nition components of the agents, allowing them to determine 
similarity to one another based on the similarity of their 
semantic descriptions. As SVMs are closely related to Neural 
Networks, being a close cousin to classical multilayer percep- 
trons [52], we will make use of the pre-processing and the 
training sets defined in the previous subsection, which will 
also ensure a fair comparison of the pattern recognition tech- 
niques in empowering the similarity recognition components 
of the agents. 

One of the most interesting recent developments in clas- 
sifier design is the introduction of the SVM [32], which is 
primarily a two-class classifier, and therefore highly suitable 
for the required similarity recognition component of our 
targeted migration. It uses an optimisation criterion that is 
the width of the margin between the classes, i.e. the empty 
area around the decision boundary defined by the distance 
to the nearest training patterns [53]. These patterns, called 
support vectors, define the classification function, and their 
number is minimised by maximising the margin [53]. This 
is achieved through a kernel function K, which transposes 
the data into a higher-dimensional space where a hyperplane 
performs the separation [53]. In its simplest form the kernel 
function is just a dot product between the input pattern and a 
member of the support set, resulting in a linear classifier, while 
nonlinear kernel functions lead to a polynomial classifier [27]. 
SVMs are closely related to Neural Networks, being a close 
cousin to classical multilayer perceptrons, with the use of a 
sigmoid kernel function making them equivalent to two-layer 
perceptrons [52]. However, in the training of NNs, such as 
multi-layer perceptrons, the weights of the network are found 
by solving a non-convex unconstrained minimisation problem, 
while the use of a kernel function in SVMs solves a quadratic 
programming problem with linear constraints [54]. An im- 
portant advantage of SVMs is that they offer the possibility 
to train generalisable nonlinear classifiers in high-dimensional 
spaces using a small training set [34]. Furthermore, for large 
training sets a small support set is typically selected for 
designing the classifier, thereby minimising the computational 
requirements during training [34]. 

The selection of a suitable kernel function is important. 
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Fig. 6. Support Vector Machine (modified from [57]): Visualisation 
showing the training set in the Input Space, and its binary classification by 
a hyperplane in the higher dimensional Feature Space, achieved through the 
kernel function. A Radial Basis Functions (RBF) is recommended for text 
categorisation [33], with the most common form of the RBF being Gaussian 
[56]. 

since it defines the feature space in which the training set 
is classified [55], operating as shown in Figure 6. A Radial 
Basis Functions (RBF) is recommended for text categorisation 
[33], with the most common form of the RBF being Gaussian 
[56]. 

Training a SVM requires solving a large quadratic 
programming (QP) optimisation problem, which Sequential 
Minimal Optimisation (SMO) breaks into a series of the 
smallest possible QP problems. SMO solves these small QP 
problems analytically, which avoids using a time-consuming 
numerical QP optimisation. SMO scales between linear and 
quadratic time complexity, relative to the size of the training 
set, because it avoids matrix computation [58]. The alternative, 
a standard Projected Conjugate Gradient (PCG) chunking 
algorithm scales between linear and cubic time complexity, 
relative to the size of the training set [58]. So SMO is faster, 
up to a thousand times on real-world sparse data sets [58]. 

The issue of the learnt behaviour of the embedded similarity 
recognition component of an agent, whether SVM or NN 
based, being inherited when the agent reproduces is known as 
the Baldwin effect [59]. The Baldwin effect has always been 
controversial within biological ecosystems [60], primarily be- 
cause of the problem of confirming it experimentally [61]. 
Also, offspring in biological ecosystems can be genetically 
different to their parents [16], such that any learnt behaviour 
could potentially be inappropriate. However, the offspring in 
our Digital Ecosystem are genetically identical to their parents 
(in terms of the individual agents), and so it makes little sense 
to force the loss of learnt behaviour. So, we doubt that the 
Baldwin effect will adversely affect our Digital Ecosystem, 
which we will confirm through our simulations. 

Now that the targeted migration is theoretically complete, 
with two alternative pattern recognition techniques, we can 
confirm its effect experimentally through simulations. 

III. Simulation and Results 

Although agent-based modelling solutions, like Repast (Re- 
cursive Porous Agent Simulation Toolkit) [62] and MASON 
(Multi- Agent Simulator Of Neighbourhoods) [63], and evolu- 
tionary computing libraries, like EC J (Evolutionary Comput- 



ing in Java) [64] and the JCLEC (Java Computing Library 
for Evolutionary Computing) [65], are available, it was ev- 
ident that it would take as much effort to adapt one, or a 
combination, of these to simulate the Digital Ecosystem, as it 
would to create our own simulation of the Digital Ecosys- 
tem, because the required ecological dynamics are largely 
absent from these and other available technologies. So, we 
created our own simulation, following the Ecosystem-Oriented 
Architecture from section I, using the business ecosystem of 
Small and Medium sized Enterprises from Digital Business 
Ecosystems [66] as an example user base, adding the classes 
and methods necessary to implement the proposed targeted 
migration augmentation. Each experimental scenario was run 
ten thousand times for statistical significance of the means and 
standard deviations calculated. We implemented the targeted 
migration as defined in section II, using both Neural Network 
(NN) and Support Vector Machine (SVM) based similarity 
recognition components embedded within the agents. 

A. Agents: Semantic Descriptions 

A = {(1,25), (2,35), (3,55), (4,6), (5,37), (6,12)} 

Fig. 7. Agent Semantic Descriptions: Each simulated agent had a semantic 
description with an abstract representation consisting of a set of between three 
and six numeric tuples; each tuple representing an attribute of the semantic 
description, one integer for the attribute identifier and one for the attribute 
value, with both ranging between one and a hundred. 

An agent represents a user's service, including the semantic 
description of the business process involved, and is based on 
existing and emerging technologies for semantically capable 
Service-Oriented Architectures [22], such as the OWL-S se- 
mantic markup for web services [67]. We simulated a service's 
semantic description with an abstract representation consisting 
of a set of numeric tuples, to simulate the properties of a 
semantic description. Each tuple representing an attribute of 
the semantic description, one integer for the attribute identifier 
and one for the attribute value, with both ranging between 
one and a hundred. Each simulated agent had a semantic 
description, with between three and six tuples, as shown in 
Figure 7. 

B. User Base 

R = [{(1,23),(2,45),(3,33),(4,6),(5,8),(6,16)}, 
{(1,84),(2,48),(3,53),(4,11),(5,16)}] 

Fig. 8. User Request: A simulated user request consisted of an abstract 
semantic description, as a list of sets of numeric tuples to represent the 
properties of a desired business application; each tuple representing an 
attribute of the semantic description, one integer for the attribute identifier 
and one for the attribute value, with both ranging between one and a hundred. 

Throughout the simulations we assumed a hundred users, 
which meant that at any time the number of users joining 
the network equalled those leaving. The habitats of the users 
were randomly connected at the start, to simulate the users 
going online for the first time. The users then produced agents 
(services) and requests for business applications. Initially, 



the users each deployed five agents to their habitats, for 
migration (distribution) to any habitats connected to theirs 
(i.e. their community within the business ecosystem). Users 
were simulated to deploy a new agent after the submission 
of three requests for business applications, and were chosen 
at random to submit their requests. A simulated user request 
consisted of an abstract semantic description, as a list of sets of 
numeric tuples to represent the properties of a desired business 
application. The use of the numeric tuples made it comparable 
to the semantic descriptions of the services represented by the 
agents; while the list of sets (two level hierarchy) and a much 
longer length provided sufficient complexity to support the 
sophistication of business applications. An example is shown 
in Figure 8. 

The user requests were handled by the habitats instantiating 
evolving populations, which used evolutionary computing to 
find the optimal solution(s), agent-sequence(s). It was assumed 
that the users made their requests for business applications 
accurately, and always used the response (agent- sequence) 
provided. 

C. Populations: Evolution 

Populations of agents, [Ai, Ai, A2, ...], were evolved to 
solve user requests, seeded with agents and agent-sequences 
from the agent-pool of the habitats in which they were 
instantiated. A dynamic population size was used to en- 
sure exploration of the available combinatorial search space, 
which increased with the average length of the population's 
agent-sequences. The optimal combination of agents (agent- 
sequence) was evolved to the user request R, by an artificial 
selection pressure created by a fitness function generated from 
the user request R. An individual (agent- sequence) of the 
population consisted of a set of attributes, ai,a2,..., and a 
user request essentially consisted of a set of required attributes, 
ri,r2, .... So, the fitness function for evaluating an individual 
agent-sequence A, relative to a user request R, was 

fitness{A,R) = . ^ , r, (2) 

where a is the member of A such that the difference to the 
required attribute r was minimised. Equation 2 was used to 
assign fitness values between 0.0 and 1.0 to each individual of 
the current generation of the population, directly affecting their 
ability to replicate into the next generation. The evolutionary 
computing process was encoded with a low mutation rate, a 
fixed selection pressure and a non-trapping fitness function 
(i.e. did not get trapped at local optima). The type of selection 
used fitness-proportional and non-elitist, fitness-proportional 
meaning that the fitter the individual the higher its probability 
of surviving to the next generation [68]. Non-elitist means that 
the best individual from one generation was not guaranteed to 
survive to the next generation; it had a high probability of 
surviving into the next generation, but it was not guaranteed 
as it might have been mutated [69] . Crossover (recombination) 
was then applied to a randomly chosen 10% of the surviving 
population, a one-point crossover, by aligning two parent 



individuals and picking a random point along their length, 
and at that point exchanging their tails to create two offspring 
[69]. Mutations were then applied to a randomly chosen 10% 
of the surviving population; point mutations were randomly 
located, consisting of insertions (an agent was inserted into 
an agent- sequence), replacements (an agent was replaced in 
an agent- sequence), and deletions (an agent was deleted from 
an agent- sequence) [70]. The issue of bloat was controlled by 
augmenting the fitness function with a parsimony pressure [71] 
which biased the search to shorter agent-sequences, evaluating 
longer than average length agent- sequences with a reduced 
fitness, and thereby providing a dynamic control limit which 
adapted to the average length of the ever-changing evolving 
agent populations. 

D. Semantic Filter 

Agent's semantic description: 
{(1,25), (2,35), (3,55), (4,6), (5,37), (6,12)} 

(with semantic filter): 
{(Business, Airline), (Company, British 
Midland), (Quality, Economy), (Cost, 60), 
(Depart, Edinburgh), (Arrive, London)} 

user request: 

[{(1,23), (2,45), (3,33), (4,6), (5,8), (6,16)}, {(1,84), 
(2,48), (3,53), (4,11), (7,16), (8,34)}, {(1,23), (2,45), 
(3,53), (4,6), (5,16)(6,53)}, {(1,86), (2,48), (3,33), 
(4,25), (7,55)(8,23)}, {(1,25), (2,52), (3,53), (4,5), 
(5,55), (6,37)}, {(1,86), (2,48), (3,43), (4,25), (7,37), 
(8,40)}, {(1,22), (2,77), (3,82), (4,9), (5,35), (6,8)}] 

(with semantic filter): 
[{(Business, Airline), (Company, Air France), 
(Quality, Economy), (Cost, 60), (Depart, Ed- 
inburgh), (Arrive, Paris)}, {(Business, Hotel), 
(Company, Continental), (Quality, 3*), (Cost, 
110), (Location, Paris), (Nights, 3)}, {(Business, 
Airline), (Company, Air France), (Quality, Econ- 
omy),(Cost,60),(Depart, Paris), (Arrive, Monte 
Carlo)}, {(Business, Hotel), (Company, Conti- 
nental), (Quahty, 2*), (Cost, 250), (Location, 
Monte Carlo), (Nights, 2)}, {(Business, Air- 
line), (Company, KLM), (Quality, Economy), 
(Cost, 50), (Depart, Monte Carlo), (Arrive, Lon- 
don)}, {(Business, Hotel), (Company, Continen- 
tal), (Quahty, 3*), (Cost, 250), (Location, Lon- 
don), (Nights, 4)}, {(Business, Airline), (Com- 
pany, Air Espana), (Quality, First), (Cost, 90), 
(Depart, London), (Arrive, Edinburgh)}] 

Fig. 9. Semantic Filter: Shows the numerical semantic descriptions, of the 
simulated services (agents) and user requests, in a human readable form. The 
semantic filter translates numerical semantic descriptions for one community 
within the user base, showing it in the context of the travel industry. The 
simulation still operated on the numerical representation for operational 
efficiency, but the semantic filter essentially assigns meaning to the numbers. 



The simulation of the Digital Ecosystem complies with 
the Ecosystem-Oriented Architecture defined in the previous 
section, but there was the possibility of model error in the 
business ecosystems of the user base (Small and Medium sized 
Enterprises from Digital Business Ecosystems [66]), because 
while the abstract numerical definition for the simulated se- 
mantic descriptions, of the services and requests the users 
provide, makes it widely applicable, it was unclear that it 
could accurately represent business services. So we created 
a semantic filter to show the numerical semantic descriptions, 
of the simulated services (agents) and user requests, in a 
human readable form. The basic properties of any business 
process are cost, quality, and time [72]; so this was followed 
in the semantic filter. The semantic filter translates numerical 
semantic descriptions for one community within the user base, 
showing it in the context of the travel industry, as shown 
in Figure 9. The simulation still operated on the numerical 
representation for operational efficiency, but the semantic filter 
essentially assigns meaning to the numbers. The output from 
the semantic filter, in Figure 9, shows that the numerical 
semantic descriptions are a reasonable modelling assumption 
that abstracts sufficiently rich textual descriptions of business 
services. 

E. Controls 

The targeted migration was dependent on additional agent 
migration, which alone could have been responsible for any 
observed optimisation, because it led to greater distribution of 
the agents within the Digital Ecosystem, potentially improving 
responsiveness for the user base. So, we included a migration 
control in our experimental simulations for the additional agent 
migration, being random instead of targeted. Furthermore, to 
determine the contribution of the NNs and SVMs on the tar- 
geted migration we created a pattern recognition control, using 
a rudimentary distance function adapted from our simulated 
fitness function defined in section III-C. 

In Figure 10 we graphed for the simulation runs the average 
of the percentage response rate after a thousand time steps 
(user request events), for the Digital Ecosystem with the 
migration control, and the Digital Ecosystem with the pat- 
tern recognition control, compared to the Digital Ecosystem 
alone. The Digital Ecosystem alone averaged a 68.0% (3 s.f.) 
response rate with a standard deviation of 2.61 (2 d.p.), while 
the Digital Ecosystem with the migration control showed a sig- 
nificant degradation to 49.6% (3 s.f.) with a standard deviation 
of 1.96 (2 d.p.), and the Digital Ecosystem with the pattern 
recognition control showed only a small increase to 70.5% (3 
s.f.) with a standard deviation of 2.60 (2 d.p.). Therefore, any 
observed improvement from the targeted migration was not 
from the additional migration but its targeting, and that the 
effectiveness of the pattern recognition functionality will be 
significant if the targeted migration is to be effective. 

In Figure 11 we graphed a typical run of the Digital 
Ecosystem with the migration control, and the Digital Ecosys- 
tem with the pattern recognition control, compared to the 
Digital Ecosystem alone [3]. The Digital Ecosystem alone 
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Fig. 10. Graph of the Targeted Migration Controls and the Digital 
Ecosystem: The Digital Ecosystem alone averaged a 68.0% response rate, 
while the Digital Ecosystem with the migration control showed a significant 
degradation to 49.6%, and the Digital Ecosystem with the pattern recognition 
control showed only a small increase to 70.5%. 
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Fig. 11. Graph of Typical Runs for the Targeted Migration Controls 
and the Digital Ecosystem: The Digital Ecosystem alone performed as 
expected, adapting and improving over time to reach a mature state. The 
migration control with additional random migration ultimately decreased the 
responsiveness, while the pattern recognition control performed only slightly 
better 
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Fig. 12. Graph of Neural Network and Support Vector Machine Based 
Targeted Migration: The Digital Ecosystem with the SVM-based targeted 
migration averaged a 92.8% (3 s.f.) response rate, slightly better than the 
NN-based targeted migration at 92.1% (3 s.f.), and so significantly better 
than the Digital Ecosystem alone at 68.0% (3 s.f). 
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Fig. 13. Graph of Typical Runs for the Digital Ecosystem and Targeted 
Migration: The Digital Ecosystem alone performed as expected, adapting 
and improving over time to reach a mature state through the process of 
ecological succession [16] In comparison, the Digital Ecosystem with the 
targeted migration, NN or SVM-based, showed a significant improvement. 



succession [16]. The Digital Ecosystem with the migration 
control, which included additional random migration, while 
initially beneficial, ultimately decreased the responsiveness of 
the Digital Ecosystem. Finally, the Digital Ecosystem with the 
pattern recognition control performed only marginally better 
than the Digital Ecosystem alone. 

F. Neural Networks 

We started with the NN-based targeted migration, as defined 
in section II-A.l. We made use of Joone (Java Object Oriented 
Neural Engine) [73] to implement the required NNs, and 
exploratory programming [46] to determine that a hidden layer 
1.5 times the size of the input layer was effective for the NN- 
based similarity recognition components. 

In Figure 12 we graphed for the simulation runs the average 
of the percentage response rate after a thousand time steps 
(user request events), for the Digital Ecosystem with the NN- 
based targeted migration, compared to the Digital Ecosystem 
alone. The Digital Ecosystem alone averaged a 68.0% (3 s.f.) 
response rate with a standard deviation of 2.61 (2 d.p.), while 
the Digital Ecosystem with the NN-based targeted migration 
showed a significant improvement to a 92.1% (3 s.f.) response 
rate with a standard deviation of 2.22 (2 d.p.). 

G. Support Vector Machines 

Next we considered the SVM-based targeted migration, as 
defined in section n-A.2, making use of LIB SVM (Library 
for Support Vector Machines) [74] to implement the required 
SVMs. In Figure 12 we graphed for the simulation runs the 



average of the percentage response rate after a thousand time 
steps (user request events), for the Digital Ecosystem with 
the SVM-based targeted migration, compared to the Digital 
Ecosystem with the NN-based targeted migration, and the 
Digital Ecosystem alone. The Digital Ecosystem with the 
SVM-based targeted migration averaged a 92.8% (3 s.f.) 
response rate with a standard deviation of 2.09 (2 d.p.), slightly 
better than the NN-based targeted migration at 92.1% (3 s.f.) 
with a standard deviation of 2.22 (2 d.p.), and so significantly 
better than the Digital Ecosystem alone at 68.0% (3 s.f.) with 
a standard deviation of 2.61 (2 d.p.). 

In Figure 13 we graphed typical runs of the Digital Ecosys- 
tem with the SVM-based targeted migration, the Digital 
Ecosystem with the NN-based targeted migration, and the 
Digital Ecosystem alone. The Digital Ecosystem alone per- 
formed as expected, adapting and improving over time to reach 
a mature state through the process of ecological succession 
[16], approaching 70% effectiveness for the user base. The 
Digital Ecosystem with the targeted migration, NN or SVM- 
based, showed a significant improvement in the ecological 
succession, reaching the same performance in less than a fifth 
of the time, before reaching over 90% effectiveness for the 
user base. To show more clearly the greater effectiveness of 
the SVM-based targeted migration, compared to the NN-based 
targeted migration, we graphed in Figure 14 the frequency of 
poor matches (<50%) every one hundred time steps, for the 
Digital Ecosystem with the SVM-based targeted migration, 
compared to the Digital Ecosystem with the NN-based tar- 
geted migration, and the Digital Ecosystem alone. 
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Fig. 14. Graph of Frequencies for the Targeted Migration: The frequency 
of poor matches (<50%) every one hundred time steps, for the Digital 
Ecosystem with the SVM-based targeted migration, compared to the Digital 
Ecosystem with the NN-based targeted migration, and the Digital Ecosystem 
alone. It shows the greater effectiveness of the SVM-based targeted migration, 
compared to the NN-based targeted migration from the seven hundredth 
generation onwards. 



IV. Conclusion 

The results showed that the targeted migration optimised 
and accelerated the ecological succession [16] of our Digital 
Ecosystem, constructively interacting with its ecological and 
evolutionary dynamics. The results also showed that it was 
not the additional migration, but its targeting that created the 
improvement in the Digital Ecosystem, and that an effective 
pattern recognition technique was required for the targeted 
migration to operate effectively. Both NNs and SVMs proved 
to be effective, SVMs marginally more than NNs. The results 
also showed that there were no adverse side-effects from the 
Baldwin effect [59], the inheritance of learnt behaviour in the 
agents from the embedded similarity recognition components, 
whether SVM or NN based. Finally, based on the experimental 
results, and our theoretical understanding, we would recom- 
mend SVMs for the pattern recognition functionality of the 
targeted migration. 

A partial reference implementation [75] for our Digital 
Ecosystem, which includes an implementation of the targeted 
migration, was created by the Digital Business Ecosystem 
(DBE) project [76], and we expect that once completed will be 
deployed as part of the software platform intended for the re- 
gional deployment of their Digital Ecosystems [77], [78]. Also, 
an open-source simulation framework for Digital Ecosystems 
[79] was created by the Digital Business Ecosystems project 
[76], and is currently supported by the Open Philosophies 
for Associative Autopoietic Digital Ecosystems (OPAALS) 
project [78] to assist further research into Digital Ecosystems, 
including the wider implications of interacting with social 



systems, such as business ecosystems of Small and Medium 
sized Enterprises (SMEs). This will provide the opportunity 
to investigate further the complex system that the Digital 
Ecosystem represents, and eventually collect real world data to 
determine whether the Digital Ecosystem can perform usefully 
in a natural setting. 
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